Inferring Hierarchical Clustering Structures by Deterministic Annealingby Deterministic Annealing

نویسندگان

  • Thomas Hofmann
  • Joachim M. Buhmann
چکیده

The unsupervised detection of hierarchical structures is a major topic in unsupervised learning and one of the key questions in data analysis and representation. We propose a novel algorithm for the problem of learning decision trees for data clustering and related problems. In contrast to many other methods based on successive tree growing and pruning, we propose an ,aL”G.,P 4Lnrt;nn C.-e hM ,a~“lr,lt;n” selrl WP clP~iVc= 1 “YJ~~“‘.U &UYI”I”Y A”. “III ” .uIUYYIVY -.. . . u x.1*. . u Y non-greedy technique for tree growing. Applying the principles of maximum entropy and minimum cross entropy, a deterministic annealing algorithm is derived in a meanfield approximation. This technique allows us to canonically superimpose tree structures and to fit parameters to averaged or ‘fuzzified’ trees.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Inferring Hierarchical Clustering Structures by Deterministic Annealing

The unsupervised detection of hierarchical structures is a major topic in unsupervised learning and one of the key questions in data analysis and representation. We propose a novel algorithm for the problem of learning decision trees for data clustering and related problems. In contrast to many other methods based on successive tree growing and pruning, we propose an objective function for tree...

متن کامل

Constrained Clustering as an Optimization Method

Our deterministic annealing approach to clustering is derived on the basis of the principle of maximum entropy, is independent of the initial state, and produces natural hierarchical clustering solutions by going through a sequence of phase transitions. This approach i s modified here for a larger class of optimization problems by adding constraints to the free energy. The concept of constraine...

متن کامل

Hierarchical Pairwise Data

Partitioning a data set and extracting hidden structure arises in diierent application areas of pattern recognition, data analysis and image processing. We formulate data clustering for data characterized by pairwise dissimilarity values as an assignment problem with an objective function to be minimized. An extension to tree{structured clustering is proposed which allows a hierarchical groupin...

متن کامل

Distributional Similarity, Phase Transitions and Hierarchical Clustering

We describe a method for automatically clustering words according to their distribution in particular syntactic contexts. Words are represented by the relative frequency distributions of contexts in which they appear, and relative entropy is used to measure the dissimilarity of those distributions. Clusters are represented by "typical" context distributions averaged from the given words accordi...

متن کامل

Improved clustering using deterministic annealing with a gradient descent technique

Various techniques exist to solve the non-convex optimization problem of clustering. Recent developments have employed a deterministic annealing approach to solving this problem. In this letter a new approximation clustering algorithm, incorporating a gradient descent technique with deterministic annealing, is described. Results are presented for this new method, and its performance is compared...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999